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AMENDMENTS TO THE SPECIFICATION 



Please amend the paragraph beginning on page 19, line 26 as follows: 

Techniques for determining nucleic acid and amino acid sequence similarity are known in 
the art. Typically, such techniques include determining the nucleotide sequence of, e.g., an 
accessible region of cellular chromatin, and comparing these sequences to a second nucleotide 
sequence. Genomic sequences can also be determined and compared in this fashion. In general, 
"identity" refers to an exact nucleotide-to-nucleotide or amino acid-to-amino acid 
correspondence of two polynucleotides or polypeptide sequences, respectively. Two or more 
sequences (polynucleotide or amino acid) can be compared by determining their "percent 
identity." The percent identity of two sequences, whether nucleic acid or amino acid sequences, 
is the number of exact matches between two aligned sequences divided by the length of the 
shorter sequences and multiplied by 100. An approximate alignment for nucleic acid sequences 
is provided by the local homology algorithm of Smith and Waterman, Advances in Applied 
Mathematics 2:482-489 (1981). This algorithm can be applied to amino acid sequences by using 
the scoring matrix developed by Davhoff, Atlas of Protein Sequences and Structure , M.O. 
Dayhoff ed., 5 suppl. 3:353-358, National Biomedical Research Foundation, Washington, D.C., 
USA, and normalized by Gribskov, Nucl. Acids Res. 14(6):6745-6763 (1986). An exemplary 
implementation of this algorithm to determine percent identity of a sequence is provided by the 
Genetics Computer Group (Madison, WI) in the "BestFit" utility application. The default 
parameters for this method are described in the Wisconsin Sequence Analysis Package Program 
Manual, Version 8 (1995) (available from Genetics Computer Group, Madison, WI). An 
additional method of establishing percent identity in the context of the present disclosure is to 
use the MPSRCH package of programs copyrighted by the University of Edinburgh, developed 
by John F. Collins and Shane S. Sturrok, and distributed by IntelliGenetics, Inc. (Mountain View, 
CA). From this suite of packages the Smith- Waterman algorithm can be employed where default 
parameters are used for the scoring table (for example, gap open penalty of 12, gap extension 
penalty of one, and a gap of six). From the data generated the "Match" value reflects "sequence 
identity." Other suitable programs for calculating the percent identity or similarity between 
sequences are generally known in the art, for example, another alignment program is BLAST, 
used with default parameters. For example, BLASTN and BLASTP can be used using the 
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following default parameters: genetic code = standard; filter = none; strand = both; cutoff = 60; 
expect = 10; Matrix = BLOSUM62; Descriptions = 50 sequences; sort by = HIGH SCORE; 
Databases = non-redundant, GenBank + EMBL + DDBJ + PDB + GenBank CDS translations + 
Swiss protein + Spupdate + PIR. Details of these programs can be found on the internet at4he 
following intern e t address: http://www.nobi.nlm.gov/ogi bin/BLAST . When claiming sequences 
relative to sequences described herein, the range of desired degrees of sequence identity is 
approximately 80% to 100% and any integer value therebetween. Typically the percent identities 
between the disclosed sequences and the claimed sequences are at least 70-75%, preferably 80- 
82%, more preferably 85-90%, even more preferably 92%, still more preferably 95%, and most 
preferably 98% sequence identity to the reference sequence. 
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